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air  to  air-combat  analysis,  differential  games,  pursuit/evaalon,  threat-reciprocity 
reprisal  strategy. 


A  preference-ordered  discrete -gaming  air-to-air-combat  model  Is  described.  Ap¬ 
proximately  optimal  switchings  between  several  closed-loop  control  policies  is  provide* 
by  an  active-cell  structure  In  the  state  space,  which  Is  divided  Into  regions  of  draw  out 
come  and  various  types  of  capture.  Cell  buildup  technique  is  discussed  and  the  results 
of  computations  for  families  of  duels  presented.  A  threat-reciprocity  scheme  for 
characterizing  draw  outcomes  and  tactics  Is  discussed.  A  reprisal -strategy  scheme 
for  exploiting  opponent's  errors  by  extrapolation  la  described. 
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FOREWORD 


The  research  presently  reported  was  supported  by  USAF  Flight  Dynamics 
Laboratory,  Wright -Patterson  AFB,  Ohio  under  Contract  F33615-77-C-3144 
over  the  period  20  September  1977  to  20  May  1979.  The  technical  monitors 
were  Dr.  L.  Earl  Miller  and  Mr.  David  Johnson  of  AFFDL/FXG.  The  research 
Is  a  continuation  of  that  reported  In  Ref.  1,  Initiated  In  an  effort  for  USAF  Wea¬ 
pons  Laboratory,  Klrtland  AFB,  New  Mexico  (AFWL/PGA). 
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SYMBOLS 


horizontal  position  components,  Vehicle  #1 
horizontal  position  components.  Vehicle  #2 
speeds  of  Vehicles  #1  and  #2 
heading  angles.  Vehicles  #1  and  #2 
turn  rates,  Vehicles  #1  and  *2 
navigation  ratio 

polar  coordinates  of  Vehicle  #1  relative  to  Vehicle  #2,  defined  by 
eqs.  (10)  and  (11) 

weapon  angular  tralnablllty  limit 

extended-weapon -envelope  functions  for  capture  by  player  #1  and 
player  #2.  See  Fig.  1 

generalized  miss  quantities  defined  by  eq.  (13) 
generalized-miss  difference  -  Q2 
interpolation  scalar 

normalized  state  Increments  employed  for  Interpolation  in  cell 
structure 

switch-surface  Indices 

game-of-two-cars  miss  estimate  defined  by  eq.  (14) 

altitude 

velocity 

v2 

specific  energy  h  ♦  — 
path  angle  to  horizontal 
bank  angle 
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throttle  setting 


lift 

drag 

angle  of  attack 

aerodynamic  limit  on  lift  coefficient,  Mach-number  dependent 

llft-coefficlent  limit,  the  lesser  of  aerodynamic  and  structural 
limits 


SUMMARY 


Results  are  presented  for  the  development  of  a  preference-ordered  discrete¬ 
gaming  alr-to-alr-combat  model.  A  parameterized  model  was  employed  featuring 
gaming  choice  between  several  closed-loop  control  policies,  approximately  optimal 
switchings  between  policies  being  provided  by  an  active-cell  structure  In  the  state 
space,  which  Is  divided  Into  regions  of  two  types  of  draw  and  three  types  of  capture. 
The  technique  for  building  up  the  aggregation  of  active  cells  received  much  attention. 
A  cell  structure  was  adopted  In  which  cells  In  the  state  space  are  partitioned  In 
terms  of  elapsed  time  from  the  Initiation  of  an  encounter,  this  being  relevant  to 
control -policy  choice  through  the  definition  of  a  draw  outcome  in  terms  of  elapsed 
time.  The  buildup  technique  evolved  adds  one  cell  at  a  time  to  the  aggregation  of 
active  cells.  It  turns  out  that  few  additional  capture  cells  are  accumulated  past 
15  or  20  sec.  elapsed  time,  and  that  the  30  sec.  maximum  time  employed  In 
most  of  the  example  computations  may  have  been  excessive.  The  preference- 
ordered  gaming  technique  Is  described  In  Ref.  1,  generated  In  part  during  an  ante¬ 
cedent  effort  for  NASA -Ames  and  AFWL.  Much  of  the  material  of  Ref.  1  has  been 
included  In  the  present  report  In  the  Interest  of  a  reasonably  self-contained  pre¬ 
sentation. 

The  body  of  this  report  contains  sections  dealing  with  cell  buildup  technique, 
examples  of  preference-ordered  scoring  of  the  game  matrix,  graduation  of  draw 
outcomes  via  a  threat-reciprocity  concept,  a  reprisal -strategy  scheme  which  ex¬ 
ploits  opponent’s  errors  by  extrapolation,  choice  of  altitude  for  3-D  modeling  and 
a  description  of  point-mass  simulation  model  making  use  of  data  from  gaming 
models  for  control  logic. 
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SECTION  I 


INTRODUCTION 


A  capability  has  evolved  for  the  solution  of  pursuit/evasion  differential 
games  with  vehicle  models  sufficiently  realistic  to  be  of  some  practical  interest. 
However,  the  literature  focuses  almost  entirely  on  the  case  of  an  evader  unde¬ 
fended  save  for  evasive  maneuvering,  and  has  only  limited  applicability  to  air 
combat  between  aircraft  of  roughly  equal  capability.  One  approach  to  this  situa¬ 
tion  which  has  received  some  attention  is  the  calculation  of  outcomes  with  pursuit 
and  evasion  roles  assigned  and  again  with  them  interchanged,  then  the  use  of  these 
to  define  surfaces  separating  win,  lose,  and  draw  regions  (Refs.  2,3).  This  ap¬ 
proach  leads  to  ambiguous  results  with  large-envelope  weaponry,  which  produces 
a  large  mutual -capture  region  when  the  assumption  of  pursuit  irrespective  of  the 
consequences  is  maintained.  A  technical  approach  to  prelim Irjary  maneuvering  in 
the  face  of  possible  draw  and  or  mutual  capture  has  been  examined  conceptually 
In  Ref.  4  and  is  developed  in  a  preliminary  way  in  the  presently  reported  effort. 

Present  focus  is  on  analyzing  the  first  few  seconds  of  a  one-on-one  air 
engagement  using  rather  crude  physical  modeling  and  control  logic,  but  em¬ 
phasizing  rational  choice  between  strategies,  according  to  player  preference 
among  outcomes:  win,  loss,  mutual  capture,  purposeful  disengagement,  draw. 
The  preference-ordering  formulation  is  due  to  M.  Falco  (Ref.  5).  The  approach 
presently  explored  cannot  be  expected  to  be  very  satisfactory  for  protracted  air 
duels,  e.g. ,  "dogflghtlng"  with  gun  armament;  it  is  a  creature  of  large-envelope 


weaponry. 


SECTION  n 


VEHICLE  MODELING 


Planar  motion  of  two  constant-speed  vehicles  is  described  by  the  equations: 
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Steering  by  proportional  navigation  with  bounded  turn-rate  provides  aggressor 
options: 

Wj  ”  -  ui  l  if  u»*  *  -  tei 

a  ui*  if  s  Wj 

”  Wj  if  Wj  <*i| 

wj  -  ce 

and  C  >  0  is  the  so-called  navigation  ratio. 

A  defensive  option  is  provided  by  turn-and-dash-away  guidance,  which 
effects  a  turn  away  from  the  sightline  direction,  bringing  the  vehicle's  velocity 
vector  anti -parallel  to  the  sightline.  In  this  case,  «•  ,  appearing  in  (7),  Is 
given  by 
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SECTION  in 


CAPTURE  CRITERIA 

Weapon  envelopes  Idealized  In  terms  of  a  radius  limit  r  and  an  angular 
tralnabtllty  limit  a  are  assumed  In  the  Illustrative  examples  to  be  presented, 
although  the  computational  approach  also  lends  Itself  to  more  realistic  envelopes. 
Thus,  Vehicle  #1  captures  Vehicle  #2  when  r*  r,  and 

-  (  sin  8  sin  \  ♦  cos  8  cos  \  )  *  cos  (12) 

The  description  of  Vehicle  #2's  weapon  envelope  employs  a  similar  form;  how¬ 
ever,  the  example  computations  will  employ  a  composite  of  circular  segments 
for  two  weapons  of  differing  lethal  radii  and  angular  limits  for  Vehicle  #2. 
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SECTION  IV 


PREFERENCE  ORDERING 


If  both  aircraft  are  manned  and  only  one  (Vehicle  *1)  has  the  possibility  of 
disengagement  upward  by  virtue  of  higher  celling,  reasonable  preference  orders 
are  1 ,4, 5, 3, 2  for  V ehlcle  #  1,  and  2, 5, 4, 3,  l  for  Vehicle  *2,  where  the  scoring 
Is 

1  :  1  captures 

2  :  2  captures 

3  :  mutual  capture 

4  ;  1  disengages 

5  :  draw 

In  the  interest  of  more  nearly  unique  determination  of  optimal  control  parameters, 
it  Is  advantageous  to  adopt  graduated  preferences  within  each  category,  thus: 
minima*  time  in  1,  maximin  time  In  2.  maximum  time  In  3,  minlmax  time 
In  4,  minlmax  energy  difference  (In  rough  estimate)  in  5. 
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SECTION  V 


DRAW-SPACE  INDICES 


Control  logic  for  maneuvering  In  the  draw  region  has  received  Insufficient  at¬ 
tention  In  the  2-D  modeling  of  Ref.  I  and,  for  that  matter.  In  air-combat  modeling  in 
general.  The  threat -reciprocity  concept  of  Ref.  4  is  of  Interest  In  this  connection, 
and  Is  developed  further  In  the  following.  An  appropriate  measure  of  generalized 
mlaa  Is  the  extended-weapon-envelope  Idea  of  Roberts  and  Montgomery  (Ref.  6)  Illus¬ 
trated  In  Fig.  1.  This  offers  miss  measures  suitable  for  use  with  the  discrete¬ 
gaming  model,  the  minima  versus  time  of  the  generalized  misses  of  the  two  combatants 
furnishing  data  for  a  scoring  Index  In  the  draw  region. 

The  functions  and  Q0  of  the  Joint  state  describe  extended  weapon  en¬ 
velopes  for  the  two  players.  *  0  corresponds  to  the  capture  envelope  of  player 
*1  (Fig.  1);  the  function  to 

Q  =  min  Q.(t)  (13) 

1  0*t*tf  1 

The  generalized  misses  Q  and  Q  are  defined  for  any  control  histories,  not 

1  « 

necessarily  optimal.  In  particular,  they  are  defined  for  the  trajectory  pairs 
corresponding  to  each  matrix  element  for  which  the  outcome  Is  a  draw. 

The  following  examination  of  control  rationale.  In  the  context  of  the  pref¬ 
erence  ordering  which  Is  appropriate  to  combat  between  manned  vehicles.  Is  due 
to  Eugene  Cliff  of  Virginia  Polytechnic  Institute.  The  assumed  preference  ordering 
Is  1,5, 3, 2  for  player  #1  and  2, 5, 3,1  for  player  #2.  A  1  outcome  denotes  cap¬ 
ture  by  player  #1,  a  2  by  #2,  3  denotes  mutual  capture,  and  5  a  draw.  The 
special  type  of  draw  designated  4  In  Ref.  1  will  figure  In  the  first  example  to  be 
presented,  but  not  In  the  second. 
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If  Q2»  0,  player  *1  can  be  aggressive  and  minimize  ;  however,  if 
Q.,  a?  0  ,  then  player  #1  must  evade  and  maximize  Q2  .  If  »  o  ,  then  *2 
can  be  aggressive  and  minimize  Qo  ;  however,  if  as  0,  then  player  *2  must 
evade  and  maximize  .  Thus,  there  are  four  possible  qualitative  "states'' 
and  associated  desires  within  the  draw  region. 

A  Q  »  0  and  Q  »  0 

L  *• 

#1  will  min  Qj 
#2  will  min 

B  Q,  »  0  and  Q„  a;  0 

•  1  will  max  Q0 
#2  will  min 

C  Q^O  and  Qo»0 

*1  will  min 
#2  will  max 

D  Q  a?  0  and  Q0  as  0 

#1  will  max  Q., 

#2  will  max 

This  subdivision  of  the  draw  space  is  shown  in  Ftg.  2,  which  also  shows  1,2, 
and  3  capture  regions.  In  "states"  B  and  C ,  zero-sum  game  theory  "works" 
in  that  the  players  agree  upon  an  objective,  each  in  opposition  to  the  other. 


The  threat-reciprocity  concept  blends  the  control  policies  In  the  draw  space 
smoothly  between  the  regions.  In  terms  of  the  relative  Importance  attached  to 
and  Q,?  In  each  player's  control  choice.  The  Implementation  adopted  In  the  pres- 
ently  reported  first  computational  attempt,  however.  Is  much  cruder  than  this. 
Essentially,  It  la  the  use  of  0  and  1  weights  In  regions  B,  C,  and  D,  as  noted 
In  the  preceding  listing,  and  the  use  of  minimax  Q*s  Q^-  In  region  A.  The 
threshholds  were  set  somewhat  arbitrarily  at  Q=*  1.2  . 
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SECTION  VI 


DISENGAGEMENT 


The  possibility  of  a  deliberate  disengagement,  as  distinct  from  an  Incon¬ 
clusive  draw,  may  be  provided  for  under  appropriate  circumstances.  One  such, 
which  is  compatible  with  the  present  modeling,  is  the  attainment  of  a  large  enough 
energy  advantage,  which,  taken  together  with  a  substantial  superiority  in  ceiling, 
permits  escape  upward.  In  the  first  example  to  be  presented,  Vehicle  #1,  which 
has  less  effective  weaponry,  has  higher  performance  and  the  capability  of  dis¬ 
engaging  upward  by  zooming,  given  sufficient  specific  energy,  with  the  possibility 
of  mounting  a  second  attack  subsequently. 

Disengagement  estimates  are  made  very  roughly,  without  actually  integrating 
specific  energy  changes,  by  assuming  that  |  w  )  *  uj  along  the  entire  trajectory. 

This  is  actually  a  fairly  good  assumption  in  the  single-pass  scenario  of  the  present 
example.  The  choice  of  turn-rate  bound  involves  possible  sacrifice  of  energy  rate 
for  increasing  turn  rate,  on  account  of  maneuvering -drag  build-up.  Thus,  a  choice 
of  the  bounds  and  Implies  a  time  at  which  disengagement  is  possible,  unless 
capture  by  Vehicle  *2  has  occurred  earlier.  Any  less  favorable  outcome  past  this 
time  is  scored  as  a  disengagement. 
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SECTION  vn 


CELL  BUILDUP  TECHNIQUE 


A  measure  of  optimization  may  be  provided  wtthtn  the  confines  of  the 
parameterization  adopted,  and  other  "coarseness-of-mesh"  limitations,  by  use 
of  Kopp's  "backing -up"  Idea  (Ref.  8).  Switching  between  guidance  modes  and 
parameters  Is  permitted  both  players  at  preset  Intervals.  Optimal  guidance 
choices  are  stored  In  cellular  subdivisions  of  the  joint  state  space  (Ref.  5). 

Some  optimal  guidance  choices  are  first  determined  for  cells  neighboring  the 
target  set  by  Integration  forward  In  time  of  short  trajectory  pairs,  originating 
from  their  midpoints.  These  trajectories  are  terminated  after  three  seconds. 
They  are  then  scored  as  described  In  the  preceding  section.  It  can  be  argued 
that  the  preference-ordered  matrix  choice  Is  optimal  within  the  chosen  parame¬ 
terization,  If  the  trajectories  are  sufficiently  short  and  the  cellular  mesh  Is 
sufficiently  fine.  The  optimal  guidance  choice  Is  then  stored  for  use  subsequently 
during  passage  of  trajectories  through  such  a  cell,  now  termed  an  "active"  cell. 

The  data  of  the  first  three  tables  presented  In  the  following  section  were 
obtained  without  the  use  of  an  active-cell  structure  —  no  guidance  switchings. 

The  results  of  Tables  4  and  5  were  obtained  with  a  cell  structure  comprised  of 
250  cells  In  the  3 -space ,  each  divided  Into  ten  subcells  according  to  time  re¬ 
maining  until  attainment  of  the  specified  30  sec.  maximum  time.  A  draw  out¬ 
come  from  a  matrix  scored  with  short  trajectories  Is  used  to  activate  only  the 
short  ttme-to-go  subcells. 

The  simplified  energy  bookkeeping  used  for  figuring  disengagements  Is  not 
compatible  with  the  use  of  active-cell  structure  because  the  more  Intricate  book¬ 
keeping  version,  to  which  the  combination  leads,  amounts  to  rough  estimation  of 
energy -modeled  disengagements  —  five-state  Instead  of  three-state.  Some  ex¬ 
perimentation  of  this  kind  has  been  done;  however,  disengagements  were  figured 
without  guidance  switchings  (n  the  results  to  be  presented,  l.e. ,  the  active-cell 
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guidance  overridden.  All  disengagements  are  assumed  to  be  initiated  at  time 
zero  and  are.  In  essence,  decisions  not  to  engage.  This  seems  reasonable  for 
scenarios  in  which  both  participants  employ  gaming  logic,  precluding  blunders, 
although  It  would  likely  be  unsatisfactory  In  other  settings. 

The  variables  of  the  joint  state  space  are  the  separation  distance,  r  ,  and 
the  heading  angles  of  the  two  vehicles  measured  from  the  line  of  sight.  Each  cell 
is  divided  into  subcells  according  to  time  remaining  until  attainment  of  the  specified 
maximum  time:  thus,  in  the  first  example,  there  are  ten  subcells.  Development  of 
cell  buildup  technique,  since  the  work  of  Ref.  1  was  carried  out,  has  resulted  In  sig¬ 
nificant  gains  in  computational  efficiency  and  considerable  simplification.  The  fol¬ 
lowing  description  applies  to  the  improved  procedure,  which  employs  a  more  rigid 
discretization  than  the  former  one:  changes  In  guidance  are  Implemented  at  three- 
second  intervals  and,  similarly,  captures  are  scored  only  at  the  mesh  points.  A 
major  effect  of  the  change  is  that  trajectories  generated  in  buildup  calculations  al¬ 
ways  find  themselves  in  active  subcells  after  the  first  three-second  Interval,  and 
none  need  be  discarded  for  passing  through  neutral  subcells. 

The  cell  buildup  starts  with  three-second  trajectories  originating  from  the 
midpoints  of  target-set  cells  In  order  to  determine  which  correspond  to  mutual  cap¬ 
tures.  A  dozen  trajectories  are  generated  from  each,  corresponding  to  various 
guidance  combinations,  as  described  in  Ref.  1.  The  matrix  scoring  decides  whether 
the  subcell  has  a  single-capture  outcome  or  a  mutual -capture  outcome  (determined 
by  the  occurrence  of  a  second  capture  within  three  seconds).  Target-set  cells  In  a 
region  of  overlap  of  the  two  target  sets  can  be  scored  without  trajectory  Integrations, 
as  the  mutual -capture  outcome  Is  obvious  at  the  outset.  Such  cells  have  the  3  out¬ 
come  (mutual  capture)  In  all  ten  subcells  and  no  guidance  choice  stored,  as  the  out¬ 
come  is  instantaneous  and  independent  of  guidance.  For  other  target -set  cells  whose 
scoring  results  in  a  mutual -capture  outcome  within  the  three  seconds,  the  subcells 
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are  activated  with  3  outcomes  plus  whatever  guidance  combination  emerged  from 
the  matrix  scoring.  Next,  a  dozen  three-second  trajectories  are  run  out  of  each 
cell,  starting  at  time  27  sec. ,  and  scoring  done  via  the  matrix  procedure.  The 
trajectory  emerging  from  the  matrix  scoring  is  termed  a  "lead  trajectory".  Tra¬ 
jectory  integrations  are  continued  until  the  maximum  duration,  30  sec.  ,  If  no 
capture  has  occurred  by  then,  or  until  capture  plus  three  seconds,  In  order  to  de¬ 
termine  whether  there  Is  a  mutual  capture.  Thus,  some  nominally  three-second 
trajectories  are  actually  six,  etc.  The  continuing  buildup  employs  progressively 
longer  trajectories:  six-second  trajectories  starting  at  24  sec. ,  etc.  When  a 
subcell  Is  activated  with  a  capture  outcome,  the  same  outcome  and  guidance  are 
used  to  generate  all  of  the  longer-tlme-to-go  subcells  of  the  particular  cell. 

Active-cell  data  are  ordinarily  presented  in  three  arrays  of  computer 
printout.  The  first  two  arrays  relate  cell  number  and  cell  content.  The  third 
array  locates  the  cell  In  state  space.  The  content  of  each  subcell  is  represented 
by  three  digits,  of  which  the  first  two  are  guidance  ( 1  through  12)  and  the  last 
is  the  outcome.  The  shortest -tlme-to-go  subcell  data  appears  on  the  left  of  the 
word  In  the  second  array,  the  longest  on  the  right  In  the  first  array.  Thus,  the 
printout  for  cell  #162 


091  091  091  091  091 


in  the  first  array,  and 


015  015  015  091  091 

In  the  second.  Indicates  that,  for  the  three  shortest -time -to -go  subcells,  the 
outcome  Is  5  (a  draw)  and  the  guidance  Is  1  ,  while  for  the  seven  longest-tlme- 
to-go  subcells,  the  outcome  is  a  1-capture  and  the  guidance  9  . 
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The  entry  in  the  third  array 


1  0  9  0  5 

indicates  by  its  first  digit,  1  ,  that  the  separation  between  the  two  aircraft  lies 
between  0  and  .  75  n.  ml.  for  the  particular  cell.  The  next  two  digits,  0  9, 
indicate  that  the  function  tr  -  \  ♦  0  of  the  first  vehicle's  heading  relative  to  the 
sightline  direction  lies  between  60°  and  90°.  The  last  two  digit*,  0  5,  de¬ 
note  that  -  0  lies  between  90°  and  190°. 


SECTION  vm 


CELL  INTERPOLATION  LOGIC 


The  active-cell  representation  of  guidance,  described  In  the  foregoing,  forces 
guidance  switchings  to  take  place  at  cell  Interfaces,  and  produces  a  somewhat  jagged 
approximation  to  a  switching  surface  in  the  three-space.  An  Improved  representation 
using  Interpolation  between  cell  midpoints  Is  described  In  the  following.  The  tech¬ 
nique  has  not  been  evaluated  computationally  and  is  described  only  for  completeness 
and  as  a  possible  program  growth  item. 

If  the  cell  and  its  neighbor  are,  for  example,  deep  in  the  draw  space,  the 
appropriate  interpolation  index  is  Q*=  Q,  -  Q  .  If  the  guidances  of  the  tw-o  cells  are 

A 

row-different  or  column-different,  the  switching  crossover  may  be  located  in  linear 
approximation  by  equating  the  linearly  interpolated  representation  of  Q*  for  the  two 
matrix  elements  in  question,  producing  an  interpolation  scalar  £,  0*  (si,  which 
Is  the  fraction  of  the  distance  between  midpoints  at  which  switching  should  occur. 

If  the  guidances  are  both  row-different  and  column-different,  the  presence  of  two 
intersecting  switching  surfaces  is  Implied.  These  are  considered  separately  and 
the  two  {  values  averaged,  in  the  approximation  of  present  interest. 

The  computation  of  {  values  requires  availability  (storage)  of  the  matrix 
elements  and  associated  graduation  indices.  The  storage  requirement  is  substantial 
but  temporary,  hence  the  interpolation  is  done  at  the  conclusion  of  each  tlme-to-go 
"layer"  during  the  buildup  of  active  cells.  Since  each  cell  has  as  many  as  six 
neighbors,  permanent  core  storage  is  provided  for  six  {  values,  each  rounded  to 
three  decimal  digits'. 

If  Zj ,  l»l,  —  ,  3  ,  are  cell-variable  increments  measured  from  the  mid¬ 
point  of  the  cell  containing  the  current  state-point,  normalized  to  the  distance  between 
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the  midpoint  of  the  cell  and  the  appropriate  neighbor,  then 


Is  useful  for  determining  whether  the  guidance  of  the  reference  cell  or  that  of  one 
of  its  neighbors  should  be  used.  Three  of  the  six  £  values  stored  are  selected 
according  to  the  signs  of  the  components  of  z,  for  use  in  computing  the  p(  .  The 
p.  value  is  taken  as  zero  where  there  is  no  neighbor  or  where  the  neighbor  has 
the  same  guidance  as  the  reference  cell.  If  more  than  one  of  the  three  neighbors 
of  interest  has  the  same  guidance,  the  p  value  for  the  two  or  three  is  determined 
jointly  as 


k  <3 


I  T 

1  =  1  J 


which  accounts  for  angling  of  the  switch  surface  to  the  cell  mesh.  If  all  cell  p. 
calculated  are  p^  •»  1  ,  reference-cell  guidance  is  used.  If  at  least  one  p.  >  I  , 
the  guidance  of  the  neighbor  corresponding  to  the  largest  p.  is  used. 

Interpolation  indices  are  employed  as  follows,  according  to  the  pairing  of 
outcomes  in  the  reference  cell  and  the  neighbor  under  examination: 


1  &  1  time-to-capture 

1  &  2  Q* 

1&3  Q2 

1  &  5  01 

2  &  2  time-to-capture 

2  &  3  Q 

2  &  5  Q2 

3  &  3  time-to-capture 

3  &  5  Q* 

5  4  5  Q* 
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The  interpolation  between  active-cell  midpoints  Is  intended  to  produce  the 
equivalent  of  a  finer  cell  mesh,  as  far  as  steering  errors  are  concerned,  or, 
alternatively,  permit  the  use  of  fewer  cells  for  the  same  magnitude  of  error. 
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SECTION  IX 


DESCRIPTION  OF  EXAMPLES 

In  the  first  example  to  be  described,  Vehicle  *1  has  conventional  weaponry, 
—  o 

r  *  2  n.  miles,  and  a  =  10  .  Vehicle  *2  has  Identical  weaponry,  but  in  addition 
has  wide-angle  special  weaponry,  r  =  1.5  n.  miles,  and  a  -  6cP.  Vehicle  #1,  how¬ 
ever,  has  slightly  higher  turn  rates,  both  maximum  Instantaneous  and  sustainable, 
and  more  favorable  energy  rate  at  any  given  turn  rate.  Vehicle  *1  is  allowed  a 
four-way  choice:  proportional  navigation  with  any  of  three  u;  values,  maximum 
instantaneous,  maximum  sustainable,  and  zero:  dash-away.  Vehicle  *2,  who  must 
win  early  or  never,  is  restricted  to  choice  between  proportional  navigation  with 
maximum  instantaneous  and  maximum  sustainable  rates,  and  dash-awav.  The  en¬ 
gagement  takes  place  with  both  craft  at  a  specific  enengy  of  31,000  ft.  Speeds  are 
around  1000  ft. /sec.  ,  varying  somewhat  with  choice  of  u: .  The  aircraft  are  those 
of  Ref.  7.  Of  the  twelve  combinations  of  control  options,  five  permit  disengagement 
at  various  times  before  the  30  sec.  cut-off  that  denotes  a  draw  if  no  capture  has 
taken  place. 

In  the  second  example,  the  weaponry  is  the  same  as  that  Just  described, 
however,  the  performance  and  maneuverability  characteristics  are  those  of  a 
different  aircraft,  a  version  of  the  F-5,  and  the  opponents  are  identical. 


SECTION  X 


SCORING 


For  a  given  set  of  Initial  conditions,  the  trajectories  are  calculated  twelve 
times,  once  for  each  combination  of  control  parameters,  and  the  results  arranged 
In  matrix  form,  thus: 


V. 


max  toward 

med  toward 

turn -and- 
dash -away 


// 


5  1 
2 

5  J 


Here  "toward”  implies  proportional  navigation. 


Each  matrix  Is  scored  twice,  once  with  what  might  be  called  a  mintroax 
rule,  again  with  a  maxlmln  rule.  The  first  player  chooses  the  column,  the  second 
the  row,  according  to  his  preference  order.  In  minimax.  the  first  player  is  as¬ 
sumed  to  have  chosen  first;  for  each  choice  of  column,  the  row  choice  is  given  by 


[  3  3  2  2  ] 


The  first  player,  anticipating  this,  would  choose  the  outcome  3,  the  rainlmax 
score.  In  maxlmln,  the  second  player  Is  assumed  to  have  chosen  the  row  first. 
For  each  choice  of  row,  the  preference-ordered  choice  of  column  by  the  first 
player  results  in 


5 

3 

.  1 


The  second  player's  choice  of  row,  in  anticipation  of  this,  is  the  outcome  5, 
the  maJtlmin  score. 

The  terms  minimax  and  maximin  are  employed  very  loosely  here,  as  the 
game  of  interest  is  not  naturally  zero-sum  unless  player  «*l’s  preference  order  is 
the  opposite  of  player  f2's  ,  and  this  does  not  arise  naturally  in  familiar  battle 
settings. 
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SECTION  XI 


GAME-SCORING  ILLUSTRATIONS 


Examples  were  sought  which  exhibit  a  variety  of  outcomes  In  the  scoring 
matrix,  yet  do  not  have  too  many  confusing  guidance  switchings  brought  about  by 
the  active-cell  structure.  It  turns  out  that  such  cases  are  scarce  and,  finally,  a 
six-second  limit  was  Imposed  upon  engagement  duration  In  order  to  provide  one. 

A  second  example,  more  Interesting  but  slightly  more  complex,  also  features  a 
six-second  limit. 

The  first  case  chosen  has  the  participants  separated  by  Just  under  2  n.  m. 
at  the  outset,  each  In  the  other's  forward  hemisphere.  The  maneuvers  corres¬ 
ponding  to  the  various  elements  In  the  matrix  of  Initial  guidance  choices  are 
sketched  In  the  accompanying  figure.  The  matrix  Is 

‘  2  2  2  2  " 

3  2  2  2 

.1  5  5  5. 

Here  1  ■  one-capture,  2  *  two-capture,  3  ■  mutual  capture,  4  •  disengagement, 
and  5  -  draw.  No  4’s  occur  on  account  of  the  short  duration. 

The  maneuvers  corresponding  to  each  element  of  the  matrix  are  noted  on 
each  of  the  dozen  sketches  of  Fig.  3 .  There  Is  nothing  subtle  about  the  Interpreta¬ 
tion:  *2’s  best  maneuver  Is  a  maximum-rate  turn  toward  #1 ,  which  effects  cap¬ 
ture  early  no  matter  what  maneuver  #1  chooses  (top  row).  Should  #2  turn  less 
vigorously  (middle  row),  #l's  best  maneuver  Is  a  maximum-rate  turn  toward 
#2 ,  and  this  results  In  a  mutual  capture.  An  evasive  turn  away  from  his  oppo¬ 
nent  is  a  blunder  for  *2  ,  as,  in  this  event,  #1  captures  via  his  best  maneuver, 
a  maximum -rate  turn  toward  #2  (bottom  row). 


Matrix  scoring  Is  carried  out  according  to  the  preference  orders  1,  4, 

5,  3,  2  for  #1  and  2,  5,  4,  3,  1  for  #2.  The  general  Idea  Is  similar  to  the 
"worst  case"  concept  familiar  from  system  design,  although  the  closed-loop- 
control  aspect,  which  makes  the  problem  Into  a  game,  Introduces  subtle  dif¬ 
ferences.  None  of  these  complicate  the  particular  example,  however. 

#1  chooses  the  column  of  the  matrix  and  #2  the  row,  each  assuming 
the  worst,  from  his  own  viewpoint,  about  the  other's  choice.  A  choice  of  a  ma¬ 
trix  element  amounts  to  a  particular  selection  from  a  four-way  guidance  choice 
by  4 1  and  from  a  three-way  choice  by  *2.  Actually,  it  is  only  am  initial  selec¬ 
tion;  switchings  in  guidance  take  place  as  the  point  in  joint  state  space  moves 
through  the  active-cell  structure.  In  the  example,  no  switchings  occur,  except 
in  the  draw  outcomes. 

Looking  at  the  matrix  of  outcomes,  *1  reasons  thus;  If  *2  turns  away, 

I  may  choose  between  the  outcomes  of  the  bottom  row  and  obtain  a  one-capture 
by  opting  to  turn  sharply  toward  *2  ;  if  #2  turns  toward  me  at  medium  turn  rate, 
my  choice  of  outcomes  in  the  middle  row  is  again  sharply  toward,  which  effects 
a  mutual  capture;  If  *2  turns  toward  me  at  maximum  rate,  I  am  captured  no 
matter  what  and  choose  my  guidance  for  the  longest  time  to  capture  In  the  top  row 
(unfortunately,  due  to  discretization,  they  are  all  two-second  captures,  and  the 
choice  of  first  column  is  made  arbitrarily). 

#2  can  think  through  this  before  choosing  the  row;  he  Is  thus  choosing 
between  the  elements  of  a  column  vector 

2 

3 

.  1  . 
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and  the  top  element  Is  his  choice,  this  corresponding  to  s  choice  of  top  row  In 
the  matrix.  The  score  of  the  matrix  Is  2  ,  provided  by  the  preference-ordered 
choice  of  first  row,  first  column,  which  corresponds  to  both  guidance  choices  of 
turn  max  toward.  It  happens  that.  In  this  particular  engagement,  the  wide  angle 
weaponry  never  comes  Into  play.  #2's  win  is  due  to  a  slight  advantage  In  geometry 
at  the  beginning  of  the  engagement. 

A  second  example  has  the  scoring  matrix 

'  3  3  2  2 

3  3  2  2 

.1  1  1  1  . 

The  trajectories  are  shown  In  Fig.  4.  The  outcome  Is  mutual  capture  provided 
by  second-row,  second-column  Initial  guidance  choices.  Although  the  confronta¬ 
tion  Is  not  head-on.  It  has  the  elements  of  a  game  of  "chicken",  made  lethal  by 
the  reach  of  weaponry.  If  either  vehicle  turns  away  initially,  and  the  other  does 
not,  the  player  turning  away  Is  captured.  If  both  turn  away  Initially,  as  In  the 
lower  right  sketch,  the  geometrical  situation  shortly  later  favors  #1 ,  who  cap¬ 
tures  Irrespective  of  further  evasion  by  #2 .  The  active-cell  guidance  adopted 
subsequently  by  #2  merely  prolongs  the  duration  to  capture.  In  the  engage¬ 
ment  under  discussion,  #2's  wide -angle -weapon  coverage  Is  a  decisive  factor. 
Guidance  switchings  arising  from  passage  of  the  trajectory  pair  through  the  cell 
structure  are  Indicated  on  the  trajectories  by  tick  marks  followed  by  the  new 
guidance  combination  In  use,  the  numbers  1  through  12  denoting  the  matrix- 
element  combinations. 
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As  in  the  preceding  example,  the  first  player  chooses  the  column,  the  second 
the  row,  according  to  his  preference  order.  In  minimax,  the  first  player  is  as¬ 
sumed  to  have  chosen  first;  for  each  choice  of  column  by  #1  ,  the  choice  by  #2 
is  arrayed  in  a  row  as 

[  3  3  2  2  ] 

The  first  player,  anticipating  this,  would  choose  the  outcome  3  ,  the  minlmax 
score.  In  maximin,  the  second  player  Is  assumed  to  have  chosen  the  row  first. 

For  each  choice  of  row,  the  preference-ordered  choice  by  the  first  player  re¬ 
sults  in 


The  second  player,  in  anticipation  of  this,  chooses  the  outcome  3,  the  maxi¬ 
min  score.  The  terms  minimax  and  maximin  are  employed  loosely,  as  the  game 
is  not  naturally  zero-sum  unless  player  -  I  s  preference  order  is  the  opposite  of 
player  »2's,  and  this  does  not  arise  naturally  in  familiar  battle  settings.  In  the 
interest  of  more  nearly  unique  determination  of  guidance  options,  graduated  pref¬ 
erences  are  adopted  within  each  category.  Thus;  minimax  time  in  1  ;  maximin 
time  in  2  ;  maximum  time  in  3  ;  reciprocity  in  5  according  to  the  scheme  of 
the  preceding  section. 

Close  examination  of  individual  encounters,  such  as  the  two  just  described, 
led  to  a  modeling  refinement,  viz. ,  a  shift  in  preferences  following  a  first  capture 
so  that  the  capturing  player  subsequently  maximizes  his  own  margin  against  cap¬ 
ture.  This  change  was  found  to  reduce  the  number  of  mutual  captures  in  a  family 
of  engagements  by  roughly  5  to  10%  .  The  refinement  is  not  Incorporated  In  the 
results  for  a  first  example  Immediately  following,  but  is  Incorporated  In  those  for 

a  second  example  to  be  presented  In  a  later  section. 
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SECTION  xn 


RESULTS  FOR  A  FIRST  EXAMPLE 


Fourth-order  Runge-Kutta  Integration  with  1  sec.  time  steps  were  used. 

The  Interval  of  3  sec.  Is  used  as  a  building-block  for  determining  the  active-cell 
structure.  Results  are  calculated  In  each  case  for  a  specified  Initial  separation  and 
an  assumed  uniform  distribution  of  both  vehicle  headings,  every  1^  around  the 
clock.  Combinations  of  headings  producing  Initially  positive  range  rate  are  screened 
out.  In  each  case,  a  total  of  335  engagements  are  calculated  and  scored. 


The  results  for  the  first  example  to  be  presented  Illustrate  the  effects  of 
gross  modeling  changes  and  were  calculated  without  the  active-cell  structure,  l.e. , 
with  no  switchings  between  guidance  laws,  save  for  a  switch  to  "breakaway"  guidance 
by  a  capturing  vehicle  In  an  attempt  to  avert  mutual  capture.  The  following  table  pre¬ 
sents  results  conveying  an  Idea  of  the  effectiveness  of  Vehicle  •2's  wide-angle  weapon 
the  first  column  assuming  the  weapon  inoperative,  the  second  operative,  both  Incor¬ 
porating  the  maneuverability  and  performance  penalties  Incidental  to  the  weapon 
Installation. 


TABLE  1 

EFFECT  OF  WIDE -ANGLE  WEAPONRY 


2  n.  ml. 

sep. 

Outcome 

Wide-angle  weapon 
Inoperative 

Wide-angle  weapon 
operative 

1  < 1  captures! 

130 

73 

2  (2  captures) 

52 

87 

3  (mutual  capture) 

29 

74 

4  (1  disengages) 

120 

95 

5  (draw) 

5 

6 

1 

As  mentioned  earlier,  It  Is  assumed  that  a  mutual  capture  takes  place  If, 
within  3  sec.  of  a  capture,  a  capture  of  the  opposite  type  takes  place,  this  ac¬ 
counting  roughly  for  lag  In  weapon  effectiveness  (e.  g.  ,  IR  missile  time-of-fllght). 
The  effect  of  reducing  this  3  sec.  mutual -capture  period  to  zero  Is  shown  In  the 
following  table. 

TABLE  2 

EFFECT  OF  MUTUAL-CAPTURE  PERIOD 


The  effect  is  seen  to  be  appreciable  and  warrants  careful  modeling. 


It  Is  noted  that  some  of  the  captures  In  these  and  other  results  occur  Initially, 
before  any  evasive  maneuver,  and  this  Is  quite  realistic;  In  fact,  reality  has  *  high 
incidence  of  such  encounters,  reflecting  the  advantage  of  surprise. 


Table  3  shows  the  effect  of  eliminating  the  turn-and-dash-away  guidance  op¬ 
tions  for  both  vehicles.  Elimination  la  seen  to  be  of  minor  Importance  at  an  Initial 
separation  of  2  n.  ml. ,  but  decisively  In  favor  of  the  wide -angle- weapon -equipped 
vehicle  at  i  n.  mi.  in  sharply  reducing  the  number  of  draws. 


2  n.  ml. 
sep. 

Outcome 

3  sec. 

mutual -capture 
period 

0  sec. 

mutual-capture 

period 

1  (1  captures) 

73 

115 

2  (2  captures) 

87 

121 

3  (mutual  capture) 

74 

13 

4  (1  disengages) 

95 

80 

5  (draw) 

6 

6 

TABL E  3 


EFFECT  OF  TURN-AND-DASH-AWA  Y  OPTION 


Separation 

2  n.  ml. 

4  n.  mi. 

Outcome 

turn-and- 

dash-away 

optional 

no 

turn -and- 
dash-away 

turn-and- 

dash-away 

optional 

no 

turn-and- 

dash-awav 

1  (1  captures) 

73 

76 

0 

0 

|  2  (2  captures! 

87 

37 

0 

42 

|  3  (mutual  capture) 

74 

i  1 

0 

21 

4  (I  disengages) 

95 

90 

272.5 

j  5  (draw) 

6 

5 

62.  5 

n 

Table  4  presents  results  with  an  active  cell  buildup  based  upon  the  mini- 
max  rule  in  comparison  with  corresponding  results  obtained  without  the  active 
cell  structure.  The  reduction  in  the  number  of  mutual  captures  is  of  particular 
Interest.  Results  vary  considerably  with  initial  separation,  as  shown  in  Table  5, 


TABLE  4 

EFFECT  OF  ACTIVE-CELL  STRl'CTCRE 


2  n.  mi. 

1  Outcome 

with  active 
cell  structure 

without  active 
ceil  structure 

1  (1  captures) 

39 

73 

2  (2  captures) 

109 

ST 

3  (mutual  capture) 

16 

74 

4  (1  disengages) 

127 

95 

5  (draw) 

45 

6 

obtained  employing  the  active-cell  structure.  The  pattern  of  strategies  emerging  is 
of  Interest.  For  large  initial  separations,  engagements  tend  to  be  head-on  should 
both  participants  be  aggressive,  but  this  leads  to  mutual  capture,  which  both  rate 
very  low  in  the  assumed  preference  order.  Hence,  the  outcomes  tend  to  be  mainly 
draws  and  disengagements.  For  very  large  separations,  Vehicle  #1  has  ample  time 
to  gain  energy  prior  to  engagement,  and  therefore  can  Insure  a  disengagement  Instead 
of  a  draw  in  a  preponderance  of  cases.  Large-separation  scenarios  are  realistic  only 
with  assumed  surface-radar  assistance  to  furnish  the  equivalent  of  the  required  visi¬ 
bility;  settings  in  which  both  vehicles  are  provided  such  assistance  are  relatively 
rare. 


TABLE  5 

EFFECT  OF  INITIAL  SEPARATION  DISTANCE 


^'~"~---^i>epa  rat  ion 
Outcome  ■ — 

1  n.  mi. 

2  n.  mi. 

— 

3  n.  ml. 

4  n.  mi. 

1  (  1  captures ) 

113 

38 

32 

2 

2  (  2  captures  ) 

197 

109 

69 

68 

3  (  mutual  capture ) 

19 

16 

9 

4 

4  (  1  disengages ) 

2 

127 

194 

195 

5  (draw) 

4 

45 

31 

66 

Close-in,  draws  and  disengagements  occur  less  frequently,  and  mutual 
captures  take  place  occasionally  in  spite  of  the  assumed  preference  ordering. 
The  wide-angle  weaponry  generally  produces  a  high  exchange  ratio  in  favor  of 
Vehicle  *2.  Dash-away  is  rarely  adopted  close-in  as  a  means  of  retreat  on  ac¬ 
count  of  vulnerability  to  the  2  n.  ml.  narrow-angle  weaponry  in  a  rectilinear 
tail  chase. 


The  bulk  of  computational  expense  turns  out  to  be  in  the  building  up  of  the 
active-cell  structure;  the  expense  of  evaluation  runs  is  relatively  small,  even  for 
a  large  family  of  engagements.  Expense  rises  sharply  versus  the  maximum  time 
of  engagement  specified,  however,  few  additional  capture  cells  are  accumulated 
past  15  or  20  seconds.  Probably  the  30-sec.  maximum  time  is  excessive  for 
applications  of  the  kind  typified  by  this  example. 
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SECTION  xra 


REPRISAL  GUIDANCE 


The  subject  of  differential  games  is  still  young  (Refs.  9, 10, 11).  but  much 
progress  has  already  been  made  on  Us  theory,  and  enough  done  with  simple  ex¬ 
amples  to  tempt  engineers  and  operations  analysts  Into  applications  work.  Ap¬ 
plications  thinking  has  been  dominated,  quite  properly,  by  the  problem  of  obtaining 
approximate  solutions,  either  by  simplification  of  system  models  or  of  solution 
procedures.  Little  attention  has  been  paid  so  far  to  the  exploitation  of  solutions 
already  In  hand  for  closed-loop  control  purposes.  The  present  section  looks  at 
the  possibility  of  removing  some  of  the  conservatism  Inherent  In  gaming  calcula¬ 
tions  In  pursuit  evasion  applications.  It  owes  much  to  discussions  between  the 
writers  and  Eugene  Cliff  ot  VPI. 

It  Is  basic  to  differential  gaming  that  each  participant  plays  optimally,  any 
departure  by  one's  opponent  being  assumed  momentary  and  not  worth  considering 
for  planning  purposes.  Control  policies  bared  upon  the  extrapolation  of  an  opponent's 
current  departures  from  optimality  are  sometimes  called  "reprisal  strategies", 
and  have  not  been  widely  studied.  One  extreme  Is  extrapolation  of  an  opponent’s 
trajectory,  under  some  plausible  assumption  about  his  controls,  for  purposes  of 
solving  one's  own  pursuit  (or  evasion)  problem  as  an  optimal -control  problem. 

The  other  extreme  attempts  to  exploit  a  real  or  perceived  advantage  In  system - 
delay  time  (the  sum  of  Information-processing  and  control  delays)  based  upon  short¬ 
term  considerations.  For  this,  there  Is  an  applicable  theory  of  upper  and  lower 
games  (Ref.  11).  Attention  Is  directed  In  the  following  mainly  to  the  practically 
Important  cases  intermediate  between  these  extremes. 

There  Is  an  obvious  additional  margin  to  the  Informationally-advantaged 
player  In  a  discretized  sequential  game,  such  as  the  present  one.  l.e. ,  to  the 
player  who  makes  his  choice  with  knowledge  of  his  opponent's  choice.  In  pursuit/ 
evasion  games,  and  In  other  differential  games  In  which  the  Hamiltonian  function 
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is  separable,  this  advantage  is  supposed  to  disappear  in  the  limit  as  the  discretiza¬ 
tion  time-increment  shrinks  to  zero  (Ref.  11).  It  appears,  however,  that  the  effect 
of  nonzero  time-step  may  be  substantial.  There  is  also  an  important  related  effect, 
viz.  ,  that  the  informationally-advantaged  player  must  play  closed-loop  to  retain  his 
advantage.  Otherwise,  the  use  of  open-loop-optimal  strategy  against  nonoptimal 
play  may  result  in  penalties.  The  effect  is  equivalent  to  that  of  successful  decep¬ 
tion  by  his  opponent. 

A  'reprisal''  guidance  policy,  i.e.  ,  one  which  attempts  to  exploit  perceived 
departures  from  optimality  by  one's  opponent  by  extrapolation  of  his  errors,  may 
tie  svntheslzed  via  minor  changes  in  the  cell  buildup  process.  If  one  assumes  that 
errors  will  p*  rslst  for  a  time  period  equal  to  the  basic  buildup  time  increment, 
for  example,  the  procedure  is  to  erase  the  top  layer  and  rebuild  it  as  many  times 
as  the  opponent  has  guidance  options,  in  each  instance  assuming  that  the  opponent's 
freedom  is  restricted  to  the  particular  option.  This  generates  several  top  layers, 
three  or  four  :n  the  example,  to  be  used  selectively  depending  upon  the  opponent's 
choice  of  control.  If  the  opponent  plays  optimally,  the  guidance  selected  is  optimal 
in  the  preference-ordered  sense  omploved  in  the  buildup.  The  risk  lies  entirely  in 
the  assumption  that  one  s  perception  of  the  opponent's  control  is  fast  enough  and 
accurate  enough  to  trust. 

Extrapolation  of  opponent's  errors  for  a  time  spanning  several  layers  of 
subcells  is  also  possible  and  attractive:  these  layers  are  simply  rebuilt  the  requisite 
number  of  times,  assuming  that  the  opponent's  guidance  is  locked  in  to  each  choice 
in  turn  for  the  entire  extrapolation  time-span.  In  this  case,  the  guidance  generated 
is  not  optimal  against  optimal  play;  one  cannot  have  everything.  However,  a  com¬ 
promise  suggests  itself,  viz.  ,  use  preference-ordered-optimal  guidance  when  one's 
opponent  is  playing  optimally  —  extrapolate  only  in  the  face  of  nonoptimal  play. 

Such  a  mix  might  be  called  a  minimax-reprisal  composite. 
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SECTION  XIV 


ILLUSTRATIVE  NUMERICAL  RESULTS  FOR  REPRISAL  GUIDANCE 


The  example  is  Identical  to  that  of  Ref.  1  except  that  the  speed  and  turn-rate 
characteristics  of  the  two  aircraft  are  equal  —  In  the  present  case  both  are  F-5'a  — 
and  the  disengagement  outcome,  a  special  type  of  draw,  does  not  appear  as  a  result 
of  this.  Aircraft  *1  has  a  narrow-angle  (*10°)  weapon  effective  out  to  two  nautical 

miles.  Aircraft  *2  has  an  Identical  weapon  and.  In  addition,  a  wide-angle  weapon 

o 

(60  semi -apex  angle)  with  a  reach  of  1.5  n.  mi.  There  are  250  cells  each  parti¬ 
tioned  into  10  subcells  of  3-sec.  time-remaining  increment.  A  draw  Is  declared 
If  no  capture  has  occurred  by  27  seconds.  The  family  of  encounters  for  this  ex¬ 
ample  consists  of  the  250  cases  initiated  from  the  cell  midpoints. 

Table  6  summarizes  results  obtained  for  this  family  first  under  the  assump¬ 
tion  that  both  players  make  mlnlmax  control  choices,  the  term  being  used  here  loosely 
to  drraate  preference-ordered-optimal.  These  results  indicate  the  sort  of  superiority 
for  *2  that  might  be  expected  as  a  result  of  his  weaponry  advantage. 

TABLE  6 

EFFECT  OF  GUIDANCE  OPTIONS 


Outcome 

Both 

minimax 

#2  hard-turning 
#1  reprisal  (3  sec.) 

*2  hard -turning 
*1  reprisal  (30  sec.) 

1  ( *1  captures) 

21 

18 

19 

2  ( #2  captures) 

91 

84 

68 

3  (mutual  capture) 

30 

39 

35 

5  (draw) 

108 

109 

128 

11 


Results  are  also  shown  for  two  cases  In  which  #2  Is  locked  into  hard- 

turning  guidance  toward  his  opponent.  There  is  a  shift  in  favor  of  #1  ,  reflected 

in  reduction  in  *2 -captures  ,  dramatic  for  the  case  of  long-term  extrapolation 

to  T  as  much  as  30  seconds.  The  reduction  in  #1 -captures  is  believed 
max 

attributable  to  coarseness-of-mesh  in  combination  with  the  fact  that  many  draws 
are  near-captures.  It  should  be  borne  in  mind  that  85  of  the  captures  occur 
initially,  in  the  capture  set:  10  #l-captures,  52  #2-captures ,  and  23  3-captures. 

Trajectory  comparisons  are  given  in  Fig.  5  for  two  cases  in  which  #1  gains 
by  long-term  extrapolation  (30-sec.  reprisal!.  In  Fig.  5A,  minimax  optimal  play 
results  in  a  2  capture  as  a  result  of  an  initially  gentle  turn  by  *2  producing  awk¬ 
ward  geometry  for  »1,  who  turns  away  but  cannot  subsequently  avoid  *2's  wide- 
angle  weapon.  In  the  case  of  hard-turning  steering  by  «2,  which  is  overly  aggressive 
in  the  circumstances,  *1  ,  by  turning  to  the  attack,  can  effect  a  mutual  capture.  In 
Fig.  5B,  rainimax  optimal  play  results  in  a  scoreless  tail  chase,  while  hard-turning 
to  the  attack  by  »2  proves  unwise,  resulting  in  a  1  capture. 
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SECTION  XV 


ALTITUDE  CHOICE  AND  "MAKING  USE  OF  THE  VERTICAL'" 


When  the  F-l  was  first  Introduced  Into  service,  It  flew  In  mock  combat 
against  the  older  F-8,  not  very  successfully  at  first.  Over  a  long  period,  literally 
years,  the  F-4  pilots  gradually  learned  how  to  take  advantage  of  the  airplane's 
ability  to  reach  higher  energies  than  the  F-8,  and  of  Its  generally  higher  energy 
rate  (specific  excess  power,  by  "making  use  of  the  vertical",  until  the  F-8's 
turn-rate  advantage  In  the  "normal"  altitude-airspeed  range  was  eventually  over¬ 
come  (Ref.12).  The  evolution  of  good,  or  even  Improved,  tactics  evidently  takes 
much  longer  than  does  training  to  execute  currently-recommended  tactics,  es¬ 
pecially  where  vertical-maneuvering  aspects  are  Important.  It  is  suspected  that 
the  limited  flight  and  simulator  mock-combat  experience  with  thrust-augmenled- 
llft  versus  relatively  high-powered  conventional  aircraft  suffers  from  a  fairly 
severe  case  of  this  same  ailment. 

The  preference -ordered  gaming  work  of  Ref.  1  was  Inspired  by  results  ob¬ 
tained  on  a  two-cockpit  simulator  In  the  study  of  Ref.  7,  which  explored  the  potential 
of  large -envelope  wide-angle  weaponry.  No  tendency  to  take  the  engagement  to  either 
extreme  of  altitude  was  noted  In  the  simulation  study,  and  this  Is  perhaps  attributable 
to  a  large  extent  to  the  briefing  Instruction:  "be  aggressive".  The  modeling  of  Ref.  1 
is  2-D,  except  for  deliberate  disengagement  to  a  high-energy  haven,  treated  on  a 
simplified  basis.  This  resulted  partly  from  the  investigation's  Initial  purpose,  viz., 
to  Illuminate  the  simulation  data,  and  was  due  partly  also  to  a  natural  inclination  to 
deal  with  the  simpler  problem  first,  deferring  an  attack  on  the  3-D  problem. 

In  the  following,  the  question  of  altitude  choice  will  be  taken  up  for  the  case 
of  role-determined  pursuit/evasion  with  "well-separated"  geometry,  l.e. ,  each 
combatant  Initially  well  out  of  reach  of  the  other's  weaponry.  The  characteristics 
of  the  two  aircraft  of  Ref.  1  will  be  used  for  example  computations.  As  Is  usual 


33 


with  gaming  logic,  the  tactics  emerging  are  sharply  tailored  to  differences  in  the 
characteristics  of  the  opposing  vehicles  and,  in  this  sense,  the  findings  are  spe¬ 
cialized.  Yet  the  approach  for  a  general  pair  of  combatants  will  be  apparent. 

The  altitude-choice  logic  emerging  is  of  interest  for  use  in  future  application  of 
gaming  tactics  to  3-D  point-mass  simulations. 


Modeling  attractive  for  gaming  choice  of  altitude  in  the  well-separated  case 
is  that  of  Ref.  13,  based  upon  the  Game  of  Two  Cars.  The  scenario  has  the  vehicles 
role -designated  and  Initially  arrayed  llne-astern.  The  evader  allows  the  pursuer  to 
approach  to  a  certain  distance,  meanwhile  adjusting  his  own  altitude;  then  he  executes 
a  breakaway  maneuver.  If  the  evader's  normal  acceleration  capability  exceeds  that 
of  the  pursuer,  a  miss  develops  whose  horizontal  component  is  estimated  by  Eq.  (7) 
of  Ref.  13: 
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2  V  u> 

2  2 


, -  V 

1  \  -2-L  n  _I 

2  \  2  J 


(14) 


where  V  and  V.  are  true  airspeeds  of  evader  and  pursuer,  respectively,  and  tu. 

I  i  1 

and  are  their  respective  turn  rates.  The  vertical  component  of  the  miss  is  the 
difference  in  altitudes,  chosen  by  the  pursuer.  Two  limiting  cases  for  the  vehicle 
speeds  were  examined  in  Ref.  13,  the  case  of  marginal  overtaking  speed  and  the  case 
of  a  speed  advantage  chosen  by  the  pursuer  for  maximum  normal  acceleration.  A 
speed  assumption  appropriate  to  the  present  application  is  that  both  aircraft  speeds 
are  determined  from  current  specific-energy  values  and  choices  of  altitude.  The 
pursuer's  altitude -speed  choice,  constrained  by  a  requirement  that  his  speed  equal 
or  exceed  the  evader's,  is  made  so  as  to  minimize  the  root -sum -square  of  horizontal 
and  vertical  miss  components. 


If  the  joint  state  vector  is  within  the  "draw"  region,  an  extended -duration 
chase  is  likely  and.  In  this  case,  sustainable-acceleration  figures  are  appropriate 
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for  miss  estimation.  On  the  other  hand,  maximum-instantaneous -acceleration 
figures  are  applicable  within  a  capture  region  In  state  space,  or  near  Its 
boundary. 

Consider  first  the  case  of  specific  energies  of  the  example  aircraft  equal 
at  31K,  extended -duration  assumptions,  Aircraft  #2  the  pursuer  *nri  #1  the 
evader.  In  this  setting,  the  relatively  clean  and  light  Aircraft  #1  takes  the 
chase  to  low  altitude,  drawing  #2  down  near  his  flight  envelope's  lower-llmlt 
altitude,  approximately  4000  ft. ,  generating  a  substantial  sustainable-g  de¬ 
ficiency  and  miss,  about  2300  ft.  (Fig.  6) .  If  the  pursuer  has  less  specific 
energy  than  the  evader,  he  finds  himself  worse  off,  as  the  requirement  to  at 
least  match  his  speed  to  that  of  the  evader  forces  him  to  descend  below  the  evader 
to  a  region  of  high  dynamic  pressure  and  high  drag.  In  the  case  of  pursuer  energy 
higher  than  the  evader,  computations  by  rote  predict  a  serious  problem  by  virtue 
of  high  drag;  however,  a  little  reflection  shows  that  excess  energy  Is  easily  dis¬ 
posable,  In  the  particular  scenario,  by  throttling  and  speed-braking,  and  that, 
furthermore ,  some  can  be  traded  advantageously  for  hlgher-than-sustalnable 
turn  rate.  So  the  worst  case  for  the  evader  Is  pursuer  energy  slightly  higher  than 
his  own. 

In  the  case  of  not -quite -so -well -separated  geometry,  capture  possibly 
Imminent,  similar  miss  computations  carried  out  with  maximum  Instantaneous 
turn  rates  are  of  Interest.  The  pursuing  Aircraft  #2's  weight  disadvantage 
enters  the  miss  computations,  but  his  drag  disadvantage  does  not.  However, 
when  the  miss  Is  small  or  zero,  the  drag  disadvantage,  of  course,  Influences 
the  energy-rate  comparison  that  decides  the  evasion  altitude.  With  equal  ener¬ 
gies,  the  miss  Is  small  over  the  entire  altitude  range;  the  maximum  Is  300  ft. 
at  the  upper  limit  of  the  flight  envelope,  approximately  27,000  ft. ,  where  there 
Is  a  S00  ft.  /sec.  energy-rate  margin  In  favor  of  the  evader.  With  a  pursuer 


energy  disadvantage,  the  greatest  miss  occurs  for  the  same  hlgh-altltude  choice, 
and  the  miss  slightly  exceeds  the  energy  difference.  If  the  pursuer  has  an  energy 
advantage,  descent  to  the  lower  reaches  of  the  envelope  becomes  attractive,  pri¬ 
marily  on  energy  rate. 

With  the  relatively  heavy  and  aerodynamlcally  dirty  Aircraft  #2  as  evader, 
dominating  the  choice  of  altitude,  well -separated  engagements  find  the  evader 
moving  aloft  to  the  top  of  this  flight  envelope.  With  energies  equal  to  3 IK  ,  or 
with  a  pursuer  energy  advantage,  the  miss  estimate,  employing  sustainable  turn 
rates,  is  zero  over  the  whole  altitude  range  and  the  evader's  choice  of  the  upper 
end  is  made  by  comparing  energy  rates.  If  the  pursuer  has  an  energy  deficiency, 
the  miss  is  constant  at  the  value  of  the  energy  difference  over  the  altitude  range, 
and  again  the  energy-rate  balance  decides.  The  balance  is  unfavorable  to  the 
evader  over  the  whole  altitude  range,  but  it  is  least  unfavorable  at  the  high  end. 
Similar  results  emerge  when  maxlmum-instantaneous-rate  turns  are  assumed. 

The  long-term  trend  in  the  draw  space  Is  against  the  evader. 


SECTION  XVI 


POINT-MASS  SIMULATION 


The  equations  of  motion,  data  representation,  and  control  logic  for  the  one- 
on-one  point -mass -modeled  air-combat  simulation  are  described  In  the  following. 


1.  EQUATIONS  OF  MOTION 

Motion  of  each  aircraft  satisfies  a  constant-mass  three-degree-of-freedom 
system  of  differential  equations  as  follows: 


h 

y 

E 


V  sin  y 

g  cos  u 
WV 


(L-T 


V  T^  cos  a 


sin  or)  -  —  cos  v 

VD 


W  W 


glL  +  T^  sin  or)  sin  u 
V  W  cos  y 

V  cos  y  sin  \ 

V  cos  y  cos  x 

v2 

Here  h  Is  altitude,  y  flight-path  angle  to  the  horizontal,  E»h  +  —  specific 
energy,  V  velocity  magnitude,  x  beading  angle,  x  and  y  horizontal  position 
components,  u  bank  angle,  T  maximum  thrust,  77,  0*rj*l,  throttle  setting, 
L  lift,  D  drag,  and  a  angle  of  attack.  Geometry  (n  the  horizontal  plane,  as 
viewed  from  above.  Is  shown  In  the  following  sketch. 
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The  bearing  angle  9  is  defined  by  eq.  (11). 


2.  DATA  REPRESENTATION 

Maximum  engine  thrust  T(M  ,  h)  is  represented  as  a  function  of  Mach 
number  M  and  altitude  h  via  double -table -lookup  featuring  a  rectangular  array 
and  cubic -spline-lattice  interpolation  (Ref.  14).  Lift  and  drag  are  given  by 

L  =  qsc^  *  qsc^  a 

a 

D  •  Ws/ 
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The  lift  is  modeled  as  linear  with  angle  of  attack,  a  ,  and  the  drag  quadratic  with 
lift  coefficient  Is  drag  Increment  due  to  fully  extended  speed  brakes; 

the  brake  setting  Is  o,  0  *  a  £  1  .  Zero-lift  drag  coefficient  C^(M)  Is  represented 

o 

as  a  tabular  function  of  M  ,  with  cubtc-spllne  Interpolation.  The  drag  Increment 

Is  taken  as  a  fixed  fraction  (Input)  of  .  The  drag-due-to-llft  coefficient 

o 

(M)  Is  a  tabular  function  of  M  with  spline  Interpolation. 


3.  BANK-ANGLE  LOGIC 


For  a  hard-turning  mode,  the  lift  coefficient  is  taken  as 


.  f  '  n  W  \ 

min  CL  ,  — — 

^  L  ’  qS  / 


Here  C^(M)  is  maximum  lift  coefficient,  a  splined  tabular  function  of  M  ,  and  x 
Is  structural  limit  load  factor. 

For  a  sustainable-turn  mode,  is  taken  as 

/  •  n  W  \ 

cl  =  min  cL-^i-*  °l  ; 

SS  8 

where  Cj  Is  the  lift  coefficient  for  sustainable  turning,  approximated  as  that 
for  T=>D? 


-s\2  v 

i.  n. 


With  Ax  *  X2  “  xj  •  Ay  =  >’2  “  yl  •  and  Ah  =  h9  -  h^  ,  the  pursuit  plane 
for  Vehicle  1  is  defined  as  the  plane  containing  the  line  of  sight  and  the  opponent's 
velocity  vector.  The  slope  of  the  plane  containing  the  line  of  sight  and  Vehicle  2's 
velocity  vector  Is 


The  component  of  gravity  normal  to  this  plane 


is  %  I yr+" 


and  the 


bank  angle  ^  required  to  balance  this  component  of  gravity  Is 


cos  n 


<  L1  +  sln  °1  W  1  +8o  cos 


W1 

^  /  ■■■  -  >  1  .  cos  =  1  .  Here  L  and  o 

(  Tj  sin  )  J 1  +  s  “  cos  y  p  11 

are  the  lift  and  angle  of  attack  of  the  first  vehicle.  The  sense  of  the  bank  angle 

* 

ulp  is  determined  by 


PlpS  °  If  ff*  Vl-9  +  ff  *  0 

pl*p  2  0  ,f  o*v  »  + 

where  the  test  quantity  has  been  defined  to  lie  In  the  range  tr  *  Xj-0  +  ir  *  -n 
by  the  addition  of  an  appropriate  Integer  multiple  of  2  tr . 
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The  corresponding  definitions  of  a  and  n*  are  given  by 

x  *•  p 


(Ah  yx  -  Ay  hx  f  ♦  (Ax  hj  -  Ah  xl  f 


(Ax  yl  -  Ay  x1) 


W„ 

*  2  „  , 
cos  u„  =  -  i  1 

“P  /  2~ " 

(L2+T^  sin  or 0)  VI  +8j  cos  v2 

W„ 

2  * 

-  >  1  ,  COS  un  =  1  . 

/  ~  “P 

(L2  +  x,  sin  tt,>  y  1  *Sj  cosv2 

«2p  20  " 

*;p<o «  *2  »2-fl  2  ° 


It  is  of  future  interest  to  construct  a  one-on-one  point-mass-modeled  simu¬ 
lation  program  employing  control  logic  along  the  general  lines  of  Ref.  1  and  the 
present  section,  i.e.  ,  control  policies  patched  together  selectively  from  precal¬ 
culated  gaming  computations  carried  out  with  various  simplified  models.  2-D 
Game-of-Two-Cars  modeling,  as  In  Ref.  1,  will  apply  for  nearly-equal  energies 
and  well-determined  roles,  l.e. ,  within  the  capture  regions  of  active-cell  struc¬ 
ture  which  has  been  precalculated  at  two  or  more  equal-energy  points  assuming 
"corridor"  altitudes.  The  reference  plane  ot  the  2-D  maneuvering  will  not,  in 
general,  be  horizontal  but  will  derive  from  3-D  orientation  of  velocity  vectors. 
When  one  combatant  has  an  energy  advantage,  the  evader's  altitude  command  will 
be  biased  in  the  direction  of  the  altitude-choice  logic  of  the  preceding  section,  for 
well-separated  geometry,  and  similar  logic  worked  out  for  close-in  geometries 
using  "scissors"  and  "turning-game"  (Ref.  3)  modeling. 


41 


SECTION  XVII 


CELL  BUILDUP  AND  EVALUATION  PROGRAM 

Four  turn  rates  are  input  for  Vehicle  #1 :  maximum  instantaneous,  maxi¬ 
mum  sustainable,  zero  ("straight-flight"  option),  and  maximum  instantaneous 
again  ("turn-and-dash-away"  option).  These  are  calculated  separately,  with  velocity 
values  corresponding  to  constant  specific-energy,  altitude  unrestricted.  There  are 
three  turn  rates  for  Vehicle  #2  ;  the  "straight-flight"  option  is  missing.  If  the 
best-turn  and  best -dash  altitudes  for  the  two  opposing  aircraft  do  not  come  out  rea¬ 
sonably  close  (as  they  have  in  the  examples  reported),  a  compromise  is  required 
to  force  the  scenario  into  2-D. 

The  program  flows  in  the  following  manner.  Input  is  read  in  via  NAME- 
LIST  INP  and  then  written  out.  If  IPC  equals  zero,  there  are  no  existing  cells 
and  the  program  proceeds  to  generate  a  set  of  cells  according  to  input  specifications 
with  NRT  ,  N'XTl  ,  and  NXT2  determining  the  number  of  cells  to  be  generated. 
Using  the  initial  conditions  of  the  centers  of  all  the  cells,  the  capture  set  is  found. 

If  the  center  of  a  cell  results  in  a  mutual  capture,  the  cell  is  assigned  a  3  for 
capture  and  0  for  guidance,  then  is  added  to  the  capture  set.  If  the  center  of  a 
cell  indicates  a  1  or  a  2  capture,  then  12  three-second  trajectories  are  inte¬ 
grated  and,  depending  upon  input,  either  a  mlnmax  or  maxmin  solution  of  the  re¬ 
sultant  matrix  determines  the  row  and  column  to  be  used  for  guidance,  filling  all 
ten  layers  of  the  cells  with  these  values.  This  completes  the  capture  set.  The 
rest  of  the  cells  are  generated,  layer  by  layer,  using  different  sets  of  trajectories. 
Beginning  with  starting  time,  TZERO,  equal  to  the  shortest  tlme-to-go  value  at 
the  outset,  it  is  then  successively  decreased  by  three-second  Increments  until 
TZERO  =  0 . 


It  is  not  necessary  to  build  all  the  layers  in  one  run.  The  cells  are  saved, 
after  each  layer  has  been  completed,  in  a  temporary’  file  called  TAPE1  Internally. 


The  cell  buildup  terminates  after  an  Input  number  of  layers  have  been  accumulated 
(maximum  =  10).  During  this  time,  primary  as  well  as  secondary,  or  reprisal, 
cells  are  built  (according  to  an  input  trigger).  If  the  cells  are  being  built  piecemeal, 
one  has  to  make  sure  that  the  various  options  used  are  exactly  alike,  to  avoid  get- 
ting  a  mixed  bag.  (See  INPUT  section:  IFN  ,  LGI ,  MTPL  ,  MMED  ,  specifically.) 
NAMELIST  INP2  is  used  when  the  cell  buildup  is  a  continuation  of  a  previous  partial 
buildup  or  If  evaluation  trajectories  are  required.  Some  items  are  repeated  in  both 
namelists  In  order  to  facilitate  printing  of  input  data. 

Three  different  evaluation  runs  are  made. 

1.  Evaluates  all  combinations  of  initial  y^'s  and  \2's  from 
Input  tables  of  \  ^  and  y  . 

2.  Evaluates  trajectories  starting  In  the  center  of  selective  cells. 

3.  Evaluates  trajectories  starting  In  the  center  of  each  cell. 

Each  run  prints  a  matrix  of  predicted  outcomes  vs.  actual  results  In  addition  to  a 
summary  total  for  each  capture  type.  A  diagonal  matrix  Indicates  that  all  the  pre- 
llctlons  were  satisfied. 

The  Input  required  for  use  of  the  program  is  defined  in  the  following  section. 
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SECTION  xvm 


INPUT 


Definitions  of  the  various  input  quantities  required  for  operation  of  the  cell 
buildup  and  evaluation  program  are  listed  below. 


NAMELIST  INP 

XI 

YI 

X2 

Y2 

C 

NPRN 

DELI 

RIB 

R21B 

R22B 

SIG1 

SIG21 

SIG22 

NEQ 

V1T(12) 

XBDIT(12) 


*  x  component  of  Vehicle  #1 

*  y  component  of  Vehicle  »1 
=  x  component  of  Vehicle  »2 

*  y  component  of  Vehicle  *2 

x  navigation  ratio;  used  in  x  ’ 

=  incremental  number  of  Integration  steps  -  used  to 
obtain  a  detailed  printout  every  NPRN  Integration 
steps 

=  Initial  Integration  Interval 

x  radius  of  capture  for  Vehicle  *1 

=  radius  of  capture  of  Weapon  1  for  Vehicle  *2 

=  radius  of  capture  of  Weapon  2  for  Vehicle  *2 

■  angle  of  capture  for  Vehicle  *1 

*  angle  of  capture  of  Weapon  1  for  Vehicle  *2 

*  angle  of  capture  of  Weapon  2  for  Vehicle  *2 

*  number  of  equations  to  be  Integrated 

*  velocities  for  Vehicle  #1  -  one  for  each  element 
In  the  matrix 

■  x  's  *or  Vehicle  *1 


Default 

0. 

0. 

0. 

0. 

0. 

10000. 


0. 

12152.231 
12152.231 
9114. 173248 
10? 


10? 


60? 


6 

0. 


0. 
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Default 


LCOP 

m 

1  or  2  -  minmax  is  used  for  score 

0 

m 

3  or  4  -  maxmin  is  used  for  score 

RTM*5) 

m 

radii  representing  midpoints  of  cells 

0. 

XITM<14) 

- 

values  of  for  midpoints  of  cells 

0. 

X2TM(10> 

- 

values  of  for  midpoints  of  cells 

0. 

IFN 

a 

0  -  use  reprisal  set  of  cells 

1 

B 

1  -  use  primary  and  secondary  cells 

a 

2  -  use  primary  cells  only 

COM2 

» 

constant  for  choosing  Q*  logic  during  minmax  or 
maxmin  evaluations 

1  9 

LGI 

* 

0  -  normal  guidance 

0 

* 

1  -  override  with  first  column  guidance 

3 

2  -  override  with  first  row  guidance 

MTPL 

s 

0  -  normal  buildup 

0 

a 

3  -  first  column  only 

■ 

4  -  first  row  only 

MMID 

3 

0  -  no  reprisal  cells 

0 

a 

3  -  reprisal  buildup  with  first  column  guidance 

a 

4  -  reprisal  buildup  with  first  row  guidance 

K3 

a 

number  of  seconds  for  blinding  contestants 

3 
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NAMELIST  INP2 


Default 


ITCP 

LCNP 

LCN 

MGLTR 

K3 

LGI 


IFN 


TZRN 


MMID 


MTPL 


number  of  last  layer  generated  for  pickup  of 
cell  buildup 

number  of  cells  In  capture  set 
total  number  of  cells  generated  ♦  l 
0  -  normal  run 

7  -  evaluation  starting  from  center  of  cells 
number  of  seconds  for  blinding  contestants 

0  -  normal  guidance 

1  -  override  with  first  column  guidance 

2  -  override  with  first  row  guidance 

0  -  use  reprisal  set  of  cells 

1  -  use  primary  and  secondary  cells 

2  -  use  primary  cells  only 

starting  and  zero  times  for  evaluation  runs 
0  -  no  reprisal  cells 

3  -  reprisal  buildup  with  first  column  guidance 

4  -  reprisal  buildup  with  first  row  guidance 

*  0  -  normal  buildup 

■  3  -  first  column  only 

*  4  -  first  row  onlv 


0 

0 

2 

0 


3 

0 


l 


0. 


0 


0 
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SECTION  XIX 


CONCLUDING  REMARKS 

The  use  of  the  preference-ordered  gaming  model  In  the  present  application 
\*as  Inspired  by  manned  simulation  (Ref.  7)  results  obtained  on  a  two-cockpit  simu¬ 
lator  exploring  the  potential  of  large-envelope  wide-angle  weaponry.  The  sparse, 
but  Impressive,  results  suggested  that  role-determination  decisions  are  crucial 
and  that  engagements,  when  decisive,  are  over  very  quickly  with  such  weaponry. 

The  preference-ordered  gaming  results  correlated  qualitatively  with  those  of  the 
manned  simulations.  The  gaming  model  seems  well  suited  to  the  study  of  next- 
generation  air-to-air  weaponry,  which  tends  to  be  large-envelope,  wide-angle, 

high  P  ,  all-aspect. 

K 

The  results  of  the  present  study  suggest  that  departures  from  optimality  in 
the  details  of  air-combat  maneuvers  are  not  as  important  as  are  mistakes  in  deciding 
whether  to  attack,  to  flee,  or  to  maneuver  for  an  improvement  in  the  situation  relative 
to  one's  opponent  without  committing  one’s  self  to  an  attack.  A  rationale  for  role- 
determination.  and  for  maneuvering  In  draw  situations  is  furnished  by  the  "threat- 
reciprocity"  concept.  The  preference-ordered  discrete-gaming  computational  approach 
In  combination  with  "reprisal"  technique  for  exploiting  the  mistakes  of  one's  opponent 
by  extrapolation  seems  promising  for  applications  work. 

The  short-term  emphasis  in  future  work  should  be  upon  streamlining  the 
computations  and  upon  systematic  exploration  of  mesh-size  effects,  mesh-interpolatlon, 
and  the  effect  of  Informational  advantage.  A  development  of  particular  interest  is  the 
blending  of  the  2-D  cell-structure  control  logic  with  altitude  logic,  along  the  lines  dis¬ 
cussed  herein.  Into  a  3-D  point-mass  digital  simulation.  This  would  provide  a  simu¬ 
lation  model  faithful  to  design  details  with  decision  and  control  logic  having  a  basis  in 
game  theory,  a  combination  hitherto  unrealized.  In  conjunction  with  manned  simulation. 
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the  model  would  likely  prove  useful  for  study  of  air  tactics,  and  possibly  for  training. 
Additional  control  logic  could  be  provided  In  the  cell  structure,  to  permit  extrapola¬ 
tion  of  opponent's  errors,  particularly  role-decision  blunders,  for  various  assumed 
periods  in  study  and  training  exercises. 

An  exciting  possibility  Is  application  to  2-on-l  and,  ultimately,  to  many-on- 
many.  The  advances  presently  reported  fall  precisely  In  the  weakest  area  of  existing 
air-combat  simulation  technology’,  viz. ,  control  logic.  Serious  enough  for  1-on-l , 
this  weakness  becomes  overwhelming  for  the  many-on-many  case.  The  attractive 
approach  Is  the  use  of  1-on-l  results  for  instantaneous  evaluation  of  the  threat  posed 
by  each  vehicle  In  the  fray  to  each  opponent,  and  the  rational  assignment  of  roles 
and  Individual  opponents,  with  particular  regard  to  weaknesses  of,  and  tactical  blun¬ 
ders  by,  the  opposition. 
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Evader 


Figure  6.  Breakaway  Mias  vs.  Altitude 
Sustainable  Turns 
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